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I.  INTRODUCTION 

We  shall  be  interested  in  estimating  states,  x and  b,  for  the 
following  (continuous-  or  discrete-time)  system,  which  is  assumed  to  be 
the  actual  system  or  truth  model: 


0(x)  = Ax  + Bb  + ; x(0) 

b = 0 ; b(0)  = b 


b(k+l)  = b(k)  = b 


£(*  I yQ)  = Hx  + Cb  + l 


In  these  equations,  we  use  operator  0(x)  to  denote  either  x(t)  or  x(k+l), 
and  we  do  not  show  the  explicit  dependence  of  vector  and/or  matrix 
quantities  on  time.  In  this  manner,  we  are  able  to  present  results  for 
continuous -time  and  discrete-time  systems  simultaneously.  The 
conditioning  notation  on  the  measurement  is  to  remind  us  of  the  model 
which  is  associated  with  the  measurement;  £( . | will  be  referred  to 
as  the  actual  measurement.  Models  other  than  will  be  introduced 
below.  In  x « R , A e R ,bcR,B«R  , ?cB  , pR  , 

H c R*Xn,  C c RSXr,  and  T^e  R*;  and,  ^£(  • ) and  T|  ( • ) are  gaussian  white 
noise  processes  for  which 


E( 5 } = 0 and 

eCti  } = o 

(4) 

§_'(T)}  = Q «(t-T) 

or  E[  |.(k)5'(i)}  = Q 

(5) 

ECl(t)  T|'(t)}  = R 5(t-r) 

or  Ef  2(k)  1 (Z) } = R 5^ 

(6) 
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* 

and 

I 1 21  (7) 

Friedland  (Ref.  1)  describes  a very  interesting  and  practical 

procedure  for  estimating  the  dynamical  states,  x,  and  the  constant,  but 

unknown  bias  states,  lj,  for  system  J . He  shows  that  the  optimum 

estimate  of  x,  x , and  its  associated  error  covariance  matrix,  P , can 
~ — x 

be  expressed  as 


and 


♦ * 

X = X + vxb 

P = P + V P,  V’ 
x X-  x b x 


(8) 

(9) 


where  x is  the  bias-free  estimate  of  x,  computed  as  if  no  biases  were 
* 

present,  b is  the  optimum  estimate  of  die  bias,  and  V is  a matrix  which 

blends  the  estimates  x and  la  together  to  give  x , the  bias  corrected 

estimate  of  x.  Matrices  P,  and  P are  defined  in  Section  II. 

- b x2 

In.  Friedland's  derivation,  he  augments  the  bias  states  to  the 

dynamical  states  and  shows  that  the  resulting  Riccati  equation  decomposes 
* 

so  that  x is  computed  via  Eq.  (8).  This  derivation  is  algebraic  in  nature, 
and  unfortunately,  cannot  be  applied  (or,  extended)  to  more  general 
dynamical  systems  (i.e.  , systems  where  la  is  time  varying).  For  this 
reason  we  present  an  alternative  approach  to  Friedland's  derivation.  We 
assume,  at  the  onset,  the  existence  of  the  decomposition  for  x in  Eq.  (8); 

* A 

assume  a specific  structure  for  b;  require  that  x and  b be  unbiased 

estimators  of  x and  b,  respectively;  and  minimize  the  trace  of  the 

error  covariance  matrix  of  x and  b , with  respect  to  a design  matrix 

which  appears  in  our  assumed  estimator  of  Is . We  then  demonstrate  that 
* * 

the  resulting  x and  b^  are  indeed  the  minimum- variance  estimators  of  x 

ii  a =**>  e { _5.  n* } = °* 


I 


* 


|i 

* 

- 

1 1 


j 
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and  b;  hence,  our  assumed  decomposition  of  x is  a valid  one. 

Our  derivation  method,  which  is  patterned  after  the  Athans  and  Tsc 
derivation  of  the  Kalman  filter  (Ref.  13),  is  constructive  in  nature,  which 
is  important  because  it  affords  us  the  possibility  of  extending  Friedland's 
results  to  more  general  classes  of  partitioned  dynamical  systems.  Such 
extensions,  which  are  beyond  the  scope  of  the  present  paper,  will  be 
described  in  a subsequent  paper. 

Some  extensions  of  and  work  related  to  Friedland's  decomposition 
can  be  found  in  Refs.  2,  3,  5,  6,  7,  8,  9,  and  19.  Agee  and  Turner 
(Ref.  5)  augmented  the  bias  states  to  the  dynamical  states  and  studied 
conditions  under  which  x can  be  decomposed  as  in  Eq.  (8).  They  showed 
that  there  are  three  basic  requirements  for  a general  linear  filter  to  be 
decomposed  as  in  Eq.  (8):  (1)  must  satisfy  a specific  equation; 

(2)  a gain  condition  of  the  form  K*  3 K~  + V^K£  must  exist  (the  gains  in 
this  equation  are  related  in  an  obvious  way  to  estimates  x»  and  6);  an(j 

(3)  Eq.  (8)  must  be  true  for  all  t (k)  including  t 3 0 (k  = 0).  Lin  and  Sage 

(Ref.  2)  formulate  Friedland's  problem  in  a different  manner.  In  their 

approach,  the  bias  states  are  thought  of  as  unknown  constant  mean  values 

of  plant  and  measurement  noise  processes.  The  dynamical  states  are 

estimated  as  usual,  whereas  die  unknown  means  are  estimated  using  the 

maximum-likelihood  formulation.  Godbole  (Ref.  3)  has  shown  that  Lin 

and  Sage's  results  are  identical  to  Friedland's.  , Bierman  (Ref.  19)  has 

obtained  Friedland's  decomposition  using  the  square -root  information 

filter,  and,  has  extended  Friedland's  results  to  smoothing.  Additionally, 

he  gives  matrix  V the  interesting  and  useful  interpretation  of  a 

* 

sensitivity  matrix;  l.e.  , V = grad  x. 

X A " 

b 

In  some  applications,  the  bias  states  are  used  to  model  constant 
instrumentation  error  sources  (Ref.  15),  of  which  there  can  be  a large 
number  (e.  g.  , 50).  It  can  happen  that  not  all  of  the  error  sources  are 
significant,  so  that  some  of  them  should  be  deleted  from  the  final  filter; 


-4- 


or,  Chat  significant  error  sources  may  have  been  initially  neglected,  and 

should  be  included  in  the  final  filter.  We  are  also  interested  in  knowing 

whether  or  not  it  is  possible  to  extend  Friedland's  decomposition  t$  the 

situation  where  b is  of  variable  dimension;  but,  in  such  a manner  that  all 

* ^ 

preceding  calculations  associated  with  obtaining  x(k)  and  x(k),  for  b of 
dimension  r^,  do  not  have  to  be  completely  redone  when  b becomes  of 
dimension  r where  r^  > r^  (the  case  r^  < r^  is  similar,  but  is  not 
discussed  in  this  paper).  This  extension  is  described  in  Section  IV. 

Bierman  (Ref.  19)  has  obtained  similar  results,  using  a square-root 
information  filter  approach. 

Some  important  preliminary  results  are  presented  in  Section  II. 

A parallel  development  of  both  the  discrete-  and  continuous -time 
decomposition,  Eqs.  (8)  and  (9),  using  our  constructive  approach,  is  given 
in  Section  III.  The  extension  of  Friedland's  decomposition  to  the  situation 
where  b is  of  variable  dimension  is  discussed  in  Section  IV.  Proofs  for 
all  theorems  are  given  in  appendices. 

n.  PRELIMINARY  RESULTS 

Recall  that  x in  Eq.  (8)  is  the  bias -free  estimate  of  x,  computed 
as  if  no  biases  were  present.  It  is  obtained  by  suppressing  die  bias  states, 
b,  in  and  applying  the  following  Kalman  filter  to  the  resulting  system:* 

O(x)  = (A-KxH)x+  KxI(.|V0)  (10) 

For  discrete-time  systems,  Kx  = Kx(k+1)  and  H = H(k+  1);  matrix  Kx 
is  the  Kalman  gain  matrix  for  calculation  of  x . 

To  facilitate  our  development  of  Eqs.  (8)  and  (9),  it  is  most 
convenient  to  Introduce  the  following  artificial  system,  j/ 

For  discrete-time  systems,  change  (A-KxH)  in  Eq.  (10)  to  (A-KxHA). 
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mSL 


O^)  = A x 2 + % ; 

£<•  \JZ)  = Hx  2 + T1 


x2(°) 


(ID 

(12) 


where  « Rn  and  all  other  quantities  are  as  defined  for  ^ Only  the 
actual  measurement,  £(  • | Vg)  exists;  artificial  measurement  y(  • | J^)  is 
non-existent,  but,  is  useful  for  analysis  purposes.  Though  we  shall  use 
in  our  analyses,  it  will  not  appear  in  any  of  our  final  results. 

A 

Letx,  denote  the  estimate  of  artificial  state  vector  x,  using  the 
artificial  measurement  £(  • |V^)l  i.  e.  , 

0(£2)  = (A-KxH)x2+  Kxy(.|y2)  (13) 


Observe  that  the  Kalman  gain  matrix  for  calculation  of  x is  exactly  the 

™ M 

* 

same  as  for  the  calculation  of  x , since  the  well-known  equations  for  the 
Kalman  gain  matrix  (and  associated  Riccati  equations)  do  not  depend  on 
the  measurement  process. 

The  following  error  processes  and  error  covariance  matrices  will 
be  used  during  our  developments: 


For  discrete-time  systems. 


change  (A  - Kx  H)  in  Eq.  (13)  to  (A  - K^  HA). 
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P = E{  x,  x'_  } 

x2  —2e  -2e 


P = E{  b x'  } 
bx^  — e — 2e 


For  discrete- time  systems,  all  error  covariances,  as  well  as  estimates 
are  for  filtered  quantities  (i.  e.  , x = x(k|k)  and  P^  = P^(k|k)),  unless 
otherwise  noted. 

The  following  theorems,  which  are  proved  in  Appendix  A,  connect 

M * 

the  states  x and  x^,  and  estimators  x and  x^- 

Theorem  1.  If  x(0)  = x,(0)and  A isnxr,  satisfying 


O(A)  = AA  + B ; A(0)  = 0 , 


x = x 2 + A b 


Theorem  2.  If  x(0)  = x,(0)and  A is  nXr,  satisfying  (for 


continuous -time  systems) 


A = (A  - Kx  H)  A + (HA  + C) 


or  (for  discrete-time  systems) 


A(k+1)  = [I  - KxH]A  A(k)  + Kx  C HA  (k+  1)  + C(k+  1)] 


where  A(0)  = 0,  then 


x * x 2 + A b 


Matrices  A and  A are  connected,  as  we  show  in: 
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Theorem  3.  Let  V = A - A.  Then  V__  satisfies  the  following 
equation  (for  continuous -time  systems) 


V = (A  - K H)  V + (B  - K C) 

X X X X 


(21a) 


or  (for  discrete -time  systems) 


V (k+1)  = [I  - K H][  A V (k)  + B]  - K C(k+  1) 

X X X X 


(2  lb) 


where  V (0)  = 0. 
x 

Matrix  V^,  in  Theorem  3,  is,  as  we  show  below,  precisely  the 
same  matrix  which  appears  in  Eqs.  (8)  and  (9). 


in.  DERIVATION  OF  FRIEDLAND'S  DECOMPOSITION 


A.  Introduction 

In  this  section,  we  assume  that  x and  b can  be  estimated  from  the 
estimators 


x = x + V b 
— — x — 


0(b)  = Cl  b + C2  y(.  \yQ)  + C3  x 


where  x is  the  bias-free  estimate  of  x,  defined  in  Section  II,  and  G^,  G2, 
Gj  (which  are  matrices  of  appropriate  dimensions)  and  V are  determined 

A A 

such  that:  (1)  x and  b are  unbiased  estimates  of  x and  b,  respectively, 

A A 

and  (2)  the  trace  of  the  error  covariance  matrix  for  x and  b is  minimized. 
We  then  show  that  the  resulting  estimators  are  the  optimal  estimators  of 
x and  b. 


rsgaenCTis.Ta. 


B.  Unbiasedness  of  the  Estimators 

The  following  two  theorems  demonstrate  how  G.,  G and  V must 
~ 1 3 x 

be  chosen  so  that  x and  b , given  by  Eqs . (22)  and  (23),  respectively,  are 

unbiased  estimators  of  x and  b.  Proofs  of  these  theorems  are  given  in 

Appendix  B. 

Theorem  4.  b is  an  unbiased  estimator  of  b,  iff 


E{b(0)}  = b 

and  (for  continuous-time  systems) 

Gi  = "C2  <nvx  + c) 

C3  = -C2  h 

or  (for  discrete-time  systems) 

GL(k+l)  = I - G2(k+1)  S(k+1) 
G3(k+1)  = -G2(k+1)HA 

where 

S(k+1)  = HAV^  + HB  + C(k+1 ) 


(24) 


(2  5a) 
(26a) 


(2  5b) 
(26b) 

(27) 


Theorem  5.  If  S is  an  unbiased  estimator  of  b and  V is  as 
* — x 

given  in  Eq.  (21),  then  x is  an  unbiased  estimator  of  x. 

Theorems  4 and  5 show  that  if  is  computed  in  a specific 
manner,  and  (for  continuous -time  systems) 


g2C  y(t|  JQ)  - (HVx+  C)  b - Hx  ] 


(28a) 
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or  (for  discrete-time  systems) 

b (k+1)  = [I  - G2(k+1)  S(k+1)]  b(k) 

+ G2(k+1)[  y(k|yQ)  - HA  x ] , (28b) 

A A f 

then,  x and  b are  unbiased  estimators  of  x and  b in  Observe  that  V 

— — — — 0 x 

(which,  as  we  pointed  out  in  Section  I,  is  a sensitivity  matrix)  maintains 

A 

die  unbiasedness  of  the  estimator  x,  whereas  and  G^»  chosen  as  the 
preceding  functions  of  C^,  maintain  the  unbiasedness  of  the  estimator  b. 
The  only  remaining  matrix  of  design  variables  is  Q . 

w 


C.  Optimal  Gain  Matrix,  G^ 


Matrix  G2  is  chosen  such  that  the  trace  of  the  error  covariance 

A A - 

matrix  for  the  x and  b estimators  is  minimized.  That  matrix,  for  «</  , 

— — 0 

denoted  P(  • ),  has  been  defined  in  Eq.  (15). 

Theorem  6.  If  P (0)  = 0,  then 

OX— 


¥t  or  Vk 


(29) 


and 


P = P + V Pu  V’ 
x x2  x b x 


P = V F. 
xb  x b 


(30) 


(31) 


so  that 


* 

Agee  and  Turner  (Ref.  5)  show  that  their  gain  condition  holds  if  Eq.  (31) 
is  satisfied.  Again  condition,  comparable  to  theirs,  does  not  fall  out  of 
our  derivation. 


-10- 


Theorem  7.  tr  P,  is  minimum  with  respect  to  G-,  iff,  (for 

—————  b c 

continuous -time  systems) 

G*  = F,  (HV  + C)'  R_1  (33a) 

4 D X 

or  (for  discrete-time  systems) 

G~(k+1)  = P S'(k+l)[S(k+l)  P S'(k+1)  + H P (k+1  | k)  H'  + R (k+1 )]" 1 (33b) 
2 b b x2 

where 

P (k+1  | k)  = A P (k|  k)  A ' + Q . (34) 

X2  X2 

Additionally,  the  optimal  error  covariance  matrix  for  P^  can  be  computed 
from  (for  continuous -time  systems) 

i>  = -P,  (HV  +C)1  R*1  (HV  + C)P  (35a) 

b b x x b 

or  (for  discrete-time  systems) 

Matrix  P , defined  in  Eq.  (16a),  is  associated  with  x for  ^ . 


P (k+1)  = [I  - G,(k+1)  S(k+i)]P. 


D.  Summary 

It  is  helpful,  at  this  point,  to  summarize  our  results.  We  let 

* _ 

- K^.  For  continuous -time  systems  we  have  shown  that 


b = -1^  (HVx+C)b  +Kb£(t|yQ) 


where 


' ~ iS1 1 -/0) " H* 

^(t)  = Pb  (HVx+C)'  R_1 


pb  = -pb(HVC)'  R_1  (nvx+c)  pb  * Pb(0) 


and  V (t)  is  computed  via  Eq.  (21a).  For  discrete-time  systems  we  have 
shown  that 


b(k+l)  = [ I - (k+1)  S(k+l)]b(k)  + (k+1)  y(k+l  \J) 


where 


J(k+l|y0)  = £(k+l \JQ)  - HAx 


Kjjtk+l)  = PbS'(k+l)[S(k+l)P  S'(k+1)  +H  P (k+1  |k)  H'  + R (k+1)]’1 


3b(k+D  = [I-  (k+1)  S(k+l)]P  (k)  ; P (o) 
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and  S(k+1)  is  computed  via  Eq.  (27),  which  uses  V (k)  from  Eq.  (21b).  In 

A 

both  cases,  x is  computed  via  Eq.  (22). 

Observe  that  the  bias -free  estimator,  x , couples  into  the  calcula- 
tion  of  l> , through  calculations  of  y ( . \^q)  and  V^;  but,  b in  no  way 
affects  x. 


E.  Estimators  x and  b are  the  Minimum-Variance  Estimators 

Kalman  (Refs.  10  and  11)  has  shown  that  a necessary  and  sufficient 
condition  for  an  estimator  (b,  or  x ) to  be  the  optimal  (i.e.  , minimum- 
variance)  estimator  is,  that  (for  continuous -time  systems) 


(44a) 


or  (for  discrete-time  systems) 


In  Appendix  D,  we  use  these  criteria  to  prove: 

A A 

Theorem  8.  Our  optimal  linear  estimators  of  x and  b , given  by 
Eqs.  (22)  and  (for  continuous -time  systems)  (36)  - (3  9),  or  (for  discrete- 
time systems)  (40)  - (43),  are  the  optimal  estimators  of  x and  b in 

With  this  result  we  have  completed  our  constructive  derivation  of 
Friedland's  decomposition. 


IV.  MULTISTAGE  ESTIMATION  OF  BIASES 


We  now  direct  our  attention  at  the  problem  of  adding  (or  deleting) 

bias  states.  We  wish  to  do  this  in  such  a manner  that  previously  computed 

quantities  can  be  used  to  obtain  the  new  estimates  of  x and  b in  d • Let 

rl  r?  — — 0 

b = col  (b  b2),  where  b ^ e R and  b2c  R Suppose  that,  initially, 

b = b ^ then  reduces  to  ^here* 

Xjfk+l)  = A x 1(k)  + Bl  b l + § ; x^o)  (45) 

„ b 1(k+l)  = bx(k)  = b L (46) 

I (k  I Jy)  - H X l (k)  + C l b l + £ (47) 


All  quantities  in  jt ^ are  analogous  to  those  in  J Q.  When  we  believe  that 
b = b L,  then  yfkj.^)  is  die  actual  measurement. 

» , 

Let  us  assume  thatx^  has  been  computed  for  as 


» M A * 

Sl  * Sl  + • 


where  x is  the  bias-free  estimate  of  x , , and  b . and  V are  computed 
— * —1  —1  x. 

X 

from  our  results  in  Section  III,  with  the  appropriate  substitutions,  that 
B = and  C = C^.  Now,  let  us  add  additional  bias  states,  b2>  so  that 
our  system  is  U Q,  with  b = col  (bj.b^,  B = (Bj  | B2),  and  C = (Cj  C2). 
We  wish  to  estimate  ti  in  using  die  already  computed  b * and,  we 

a * $ ““  * 

wish  to  compute  x by  means  of  Eq.  (22)  using  b and  the  already  computed 


This  part  of  our  paper,  which  deals  with  recursive  bias  estimation, 
is  presented  only  for  discrete-time  systems,  since  it  seems  to  be  most 
practical  for  such  systems. 


■naara 


The  key  to  solving  this  problem  is  to  recognize  that  b,  given  by 
Eqs.  (40)  - (43),  can  be  associated  with  the  following  auxiliary  parameter 
estimation  problem  (Ref.  16):  Obtain  minimum -variance  estimates  of  the 
constant  parameter  vector,  b,  from  the  (auxiliary)  measurement  equation 


r (k+1)  = S(k+1)  b + v (k+1) 


(49) 


where  v(k+l)  is,  by  definition,  a zero-mean  white  noise  sequence,  and 


Ef  v(k+i)  v '(k+1)}  = R (k+1)  + H(k+1)  P (k+1 1 k)  H'(k+1) 

*2 


(50) 


Actually,  £(k+l)  = J(k+l|^Q),  and,  we  see  from  Eq.  (41)  thatT(k+l)  can 
be  assembled  from  the  actual  measurement  £(k+l  |j^)  and  the  bias-free 
estimate  of  x , x ; i.e.. 


I (k+1)  = £(k+l|„/j  - HAx 


(51) 


Numerous  authors  (Refs.  4,  8,  and  14,  for  example)  have  developed 
algorithms  for  estimation  of  a constant  parameter  vector  which  are 
recursive  in  the  dimension  of  that  vector,  as  well  as  sequential  in  time. 
Mendel  (Theorem  3,  Ref.  4),  for  example,  shows  that 


b^k+1)  = b l (k+1)  - G(k+1)  b2(k+l) 


(52a) 


and 


- * 


b2(k+l)  = C(k+1)[  u2(k+l)  - G^(k+1)  bl  (k+1)] 


(52b) 


where  G(k+1),  C(k+1),  u2(k+l),  and  are  computed  from  other 


Our  statement  of  Eqs.  (52a)  and  (52b)  retains  Mendel's  notation;  matrix 
C(k+1)  in  (52b)  is  not  the  same  as  our  matrix  C. 


1 


; 
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equations  given  by  Mendel.  These  equations  are  not  needed  for  our 
purposes;  hence,  they  are  not  given. 

We  conclude,  therefore,  that  it  is  indeed  possible  to  estimate  b 

y A jjt 

in  using  the  already  computed  b This  is  accomplished  by  applying 
a parameter  estimation  algorithm,  which  is  recursive  in  the  dimension  of 
b^  and  sequential  in  time,  to  estimation  of  b for  the  auxiliary  measurement 
r (k+1). 

The  following  theorem  demonstrates  how  to  compute  x(k)  from 

^ A * a 

previously  computed  x^k)  and  b^k),  and,  fromb^k).  These  results  are 
similar  to  those  in  Bierman  (Ref.  19). 

Theorem  9.  Given  x^k)  and  b^k),  for  ^ State  estimate  x(k), 
for  JQ>  can  be  computed  from: 

£(k)  = x(k)  + V (k)b*(k)  + [V  (k)  - V (k)G(k)]  b_(k)  (53) 

Xi  x2  xi 

where,  for  i = 1,2, 

V (k+1)  = [i-K  (k+1)  H(k+1)][A  V (k)  + B. ] 

X,  X X*  1 

- K (k+1)  c (k+1)  ; V (0)  = 0 , (54) 

X 1 x# 

I 

and,  b^fk)  an<^  G(k)  are  computed  from  an  algorithm  which  is  recursive 
in  the  dimension  of  ^ and  is  sequential  in  time,  such  as  the  one  in  Mendel 
(Theorem  3,  Ref.  4). 

The  proof  of  this  theorem  is  given  in  Appendix  E.  It  is  important 
to  understand  that  x(k)  = x ^(k);  i.e.  , adding  (or  deleting)  biases  does  not 
affect  the  "bias-free  estimates."  We  see,  from  Eq.  (53),  that  x (k)  does 
not  have  to  be  completely  recomputed  when  biases  are  added.  In  fact, 
another  way  to  write  Eq.  (53)  is,  as 


. 

I 


! 


i 
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(55) 


x (k)  = x.  (k)  + [ V (k)  - V (k)  G(k)]  b,(k) 

— —i.  x2  1 ~~C 

which  makes  this  even  more  evident. 

V.  CONCLUSIONS 

We  have  rederived  Friedland's  decomposition,  Eq.  (8),  by  means 
of  a constructive  approach,  which  lends  itself  to  studying  extensions  of  his 
results  to  more  complex  systems,  where  biases  are  themselves  modeled 
as  first-order  Markov  processes  (Ref.  17,  for  example).  These  extensions 
have  been  made  and  will  be  reported  on  in  a future  publication.  Our 
constructive  approach  gives  some  insight  and  physical  meaning  into 
certain  quantities,  such  as  V , which  appeared  in  Friedland's  solution. 

The  key  to  our  constructive  approach  is  die  artificial  system,  J ; for, 
through  that  system,  we  are  able  to  make  connections  between  x and  x_ 
as  well  as  x and  x 

As  a consequence  of  our  approach,  we  showed  that  b can  be  given 
the  interpretation  of  a minimum -variance  parameter  estimator  which  is 
associated  with  an  auxiliary  measurement.  With  this  interpretation,  we 
can  use  well-known  algorithms  for  estimating  the  bias  which  are  recursive 
in  the  dimension  of  the  bias  as  well  as  sequential  in  time.  Hence,  we 
have  shown  that  it  is  possible  to  add  (or  delete)  bias  states  in  such  a 
manner  that  previously  computed  quantities  can  be  used  to  obtain  new 
estimates  of  x and  b. 
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APPENDIX  A.  PROOFS  OF  PRELIMINARY  RESULTS 

The  proofs  presented  in  this  and  subsequent  appendices  are  for 
the  continuous -time  case.  It  is  felt  that  these  proofs  give  the  essence  of 
the  constructive  method  in  a clearer  fashion  than  those  of  the  discrete-time 
case.  Many  of  the  discrete-time  proofs  are  inductive  in  nature. 

1.  Proof  of  Theorem  1 

From  Eqs.  (11)  and  (17),  it  follows  that 

d 

^-(x2+Ab)  = x2+Ab  = A(x2+Ab)+Bb+  ; (A-l) 

but*  *2^  +A(°)b  = *2<0)  hence,  x2+ Ab  satisfies  the  same 

differential  equation  (d.  e. ) as  x.  By  uniqueness,  therefore,  x?  + Ab  = x, 
which  is  Eq.  (18). 

2.  Proof  of  Theorem  2 

From  Eqs.  (13)  and  (19a),  it  follows  that 

3T(*2+  Ab>  = iz  + Ab 

= (A  - KxH)(x2+  Ab)  + Kx[  +HAb  +Cb]  (A-2) 


Additionally,  from  Eqs.  (3),  (12),  and  (18),  we  see  that 


£(t \J0)  = ^(t|y2)+HAb+  Cb 


(A -3) 


so  that  Eq.  (A-2)  can  also  be  written,  in  terms  of  y(t|  ^1).  as 
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3T(£2+^-)  = ^ - Kx  H)  (x2+ A b)  + ^(t|  ^)  ; (A -4) 

but,  x2(0)  +A(0)  b = x2(0)  = x(0);  bence,  x£  + /T  b satisfies  the  same 
d.  e.  as  x . By  uniqueness,  therefore,  £2+Ab  = x,  which  is  Eq.  (20). 

3.  Proof  of  Theorem  3 

To  begin,  observe  from  Theorems  1 and  2 that  V (0)  = A(0)  - A*(0)  = 0. 
Then,  using  Eqs.  (17)  and  (19a),  we  see  that 

Vx  * A - A = (A  - Kx  H)  + (B  - C)  , (A -5) 


which  is  Eq.  (21a). 


APPENDIX  B.  UNBIASEDNESS  PROOFS 


1.  Proof  of  Theorem  4 


From  Eqs.  (2a)  and  (23),  we  see  that 


--(b-b)  = -Glb  - G2l(t|y0)-G3x 


(B-l) 


Substitute  Eqs.  (3),  (18)  and  (20)  into  Eq.  (B -Intake  the  expected  value  of 
the  resulting  equation,  and  use  the  fact  that  x^  is  an  unbiased  estimator 
of  x2  , to  show  that 

~Etb-b}  = -G1  Ejb}  - (G2HA  + G2C  + G3  A)b 


•(G2H  +G3)  E [ x 2 } 


(B-2) 


To  prove  sufficiency,  assume  Efbj  = E|bJ;  then,  from  Eq.  (B-2), 


g1+g2ha  + g2c  +g3  A = 0 


(B-3 ) 


G2H  + G3  = 0 


(B-4) 


Solve  Eq.  (B-4)  for  G3  to  obtain  Eq.  (26a);  substitute  that  expression 

into  Eq.  (B-3);  and  use  the  fact,  from  Theorem  3,  that  V = A - A , to 

x 

show  that 


Gx  = -C2  (H  Vx  + C)  , 


(B-5) 


which  is  Eq.  (25a). 


To  prove  necessity,  substitute  Eqs.  (25a)  and  (26a)  into  Eq.  (B-2), 


to  see  that 
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~E{  b - b } = G1  E(  b - b } 


(B-6) 


Because  of  Eq.  (24),  the  unique  solution  to  this  equation  is  E{b  - b } = 0; 

A ""  ™ 

hence,  b is  an  unbiased  estimate  of  b. 


2.  Proof  of  Theorem  5 

From  Eqs.  (18),  (8),  and  (20),  and  Theorem  3,  we  see  that 


x-x  = (*2-x2)  + Vx  (b  - b) 


(B-7) 


* A 

Now,  x is  the  optimal  (unbiased)  estimate  of  x , so  that  E{x  - x } = 0; 
““  » ”2  —2  —2  — 

hence , Ef  x - x } = £,  which  proves  that  x is  an  unbiased  estimate  of  x. 
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APPENDIX  C.  OPTIMIZATION  PROOFS 


1.  Proof  of  Theorem  6 


To  begin,  we  show  that  if  P (0)  = 0,  then  P,  =0  for  all  t.  We 

bx2  bx2 


obtain  a differential  equation  for  P by  differentiating  Eq.  (16b), 

“*2 


*b*2=  Eti.iieJ  + E'i.^eJ 


(C-l) 


From  well-known  filter  results  (Ref.  20.  for  example),  we  know  that 


iz.  = <A-KxH>i2. + 1 - Kxl 


(C-2) 


Additionally,  from  Eqs.  (14c),  (2a),  and  (28a),  we  find  that 


b = -G,[  (HV  + C)  b + H x _ + 7)  ] 

— e 2 x — e — 2e  — J 


(C-3 ) 


It  is  then  relatively  straightforward  to  show  that 


E(Se£ie>  = P - G2H  P + iG'2B  K;  (C-4) 

to  to 


and 


1 


E[  b x ' } = P,  (A  - K H)'  + - G,  R K' 
— e — 2e  bx_  x 2 2 x 


(C-5) 


Substituting  Eqs.  (C-4)  and  (C-5)  into  Eq.  (C-l),  making  use  of  the  fact 


that  HP^j  RKx'  we  see  that  Pbxo  satisfie3  the  following  linear  homo- 


geneous d.  e. : 
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P,  • = -G,(HV  +C)  Pv  + Pu  (A  - K H) 
bx2  2 x b*2  bx2  x 


(C-6) 


Since  P,  (0)  = 0,  it  follows  that  P,  =0,  all  t. 

^2  **2 

Next,  we  develop  the  expression  for  P^  in  Eq.  (30).  From  Eq. 
(B-7),  we  see  that 


P = P + V Pu  + P . V'  + V P^  V'  , 
x x2  x bx2  x2b  x x b x 


(C-7) 


and  this  expression  reduces  to  Eq.  (30),  since  P,  = 0. 

b*2 

In  a similar  manner,  it  follows  from  Eqs.  (B-7)  and  (29),  that 


P . = Pu  + V P = V Pu 

xb  bx2  x b x b 


(C-8) 


which  is  Eq.  (3  1). 

Finally,  to  obtain  Eq.  (32)  for  P(t),  substitute  Eqs.  (30)  and  (31) 
into  the  right-hand  side  of  Eq.  (15)  and  collect  the  terms  as  shown. 

2.  Proof  of  Theorem  7 


We  seek  to  minimize  tr  P with  respect  to  G_,  where 

D Z 


. P.  = -G-  (HV  + C)  P.  - P.  (HV  + C)'  Gi  + G.R  Gi 
b c x bbx  222 


(C-9) 


Equation  (C-9)  follows,  in  the  usual  manner,  from  Eqs.  (C-3)  and  (29). 
Our  approach  is  the  matrix  minimum  principle,  in  the  manner  of  Ref.  12, 
applied  to  the  following  Hamiltonian,  H: 
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H = tr  [ Pb(t)  T(t)]  (C-10) 

From  the  condition  that  BH/BG2  = 0»  and  gradient  matrix  operations,  we  see 
tbat  tr  is  minimum  with  respect  to  G2>  if 

C2=  Pb(HVx+C)'R'1  ' (C-ll) 


which  is  Eq.  (33a). 

Equation  (3  5a)  follows  upon  substitution  of  Eq.  (C-ll)  into  Eq.  (C-9). 


To  prove  Theorem  8,  we  use  the  orthogonality  principle  (Refs.  10 
and  11)  stated  in  Eq.  (44a).  Because  the  details  of  the  proof  are  somewhat 
intricate,  and  do  not  contribute  to  further  understanding  of  this  paper's 
problems,  we  give  a brief  sketch  of  the  proof.  Details  can  be  found  in 
the  second  author's  Ph.  D.  dissertation  (Ref.  18). 

From  Eq.  (B-7),  we  see  that 

I'(t|/0)}\  /v\ 

x’<t i-fy  ■ * l o /V1  y f-«2-'(T|v°)1 

(D-l) 

Using  Eq.  (A-3)  and  the  fact  that  x is  the  optimal  estimate  for  y^(t|^/’^)> 

it  follows  that  E[  x y'(T  | JQ)}  = 0,  all  0 < t < t.  To  show  that 

E{  b y'(r  | Jn)}  - 0,  all  0 < r < t,  consider  that  expectation  first  for 

0 < T < t and  then  for  r = t.  In  the  former  situation,  show  from  Eq. 

(C-3)  thatEfb  y'  (t  | «/  )}  satisfies  a linear  homogeneous  d.  e.  with  zero 
— e — 0 

initial  conditions;  hence,  E{b  y 1 ( t | Jn)}  - 0,  for  0 < t < t.  For  the 

— e — u — — 

latter  situation,  use  the  fact  that  b is  the  optimal  estimate  for  observation 

equation  (37);  hence,  E[b  (t)  y ' (t  | J n) } = 0,  so  that 

— e — u 

E{  b (t)y'(tlVj}  = E{  b x'(t)}H'(t)  (D-2) 

— e U e — 

Finally,  express  x(t)  as  a linear  transformation  of  the  measurements 
over  the  interval  [0,t]  and  show  that  the  right-hand  side  of  Eq.  (D-2)  is 
zero.  By  these  arguments,  we  have  shown  that  our  results  give  the 
optimal  linear  estimators  of  x and  b.  Because  all  noise  processes  are 
gaussian,  it  then  follows  that  our  results  give  the  optimal  estimators  of 
x and  b (Ref.  20). 
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